15-712 Project Proposal
نویسندگان
چکیده
In virtually all scientific disciplines, technological advances over the past years and decades have resulted in an exponential growth in the quantity of raw scientific data available for analysis. Massive datasets, ranging from hundreds of MBs to many TBs, are increasingly common and available to the public. Spanning the breadth of scientific fields, this publicly accessible information includes NASA astrophysics and astronomy collections [1], partial three-dimensional mappings of the universe [2], earthquake and seismic data [3], complete genomes for a multitude of organisms [4], and so on. However, collecting and publishing raw data is only the beginning of scientific understanding. Unfortunately, the enormous benefits to be gained by thoroughly analyzing these datasets is typically matched by the complexity of such analysis. For instance, sequence alignment is a powerful technique for predicting the function of newly discovered genes [5], but computational limitations prevent the best algorithms available from being used on full datasets. Supercomputers and customized high-performance solutions such as the CLC Bioinformatics Cube [6] can sometimes employed, but the cost of such solutions is prohibitive. Furthermore, for some classes of scientific problems involving huge datasets (including those mentioned above), processing power can be largely wasted even on a supercomputer if the computation is disk-bound. With the advent of multi-core personal computers, clusters of commodity machines have the potential to become a reasonable alternative platform for intense scientific computing. Using clusters of readily available machines in place of supercomputers or problem-specific high-performance machines could not only reduce costs, power consumption, and possibly execution time, but it would also open the multitude of rich, massive datasets to a wider community of researchers, conceivably accelerating the rate of scientific discovery.
منابع مشابه
A Case for a Flash based Metadata Server Solution in Cluster File Systems 15-712 Project proposal
متن کامل
Remote sensing urban heat-island phenomenon in four Texas cities: San Antonio, Houston, Dallas-Fort Worth, and El Paso
A project proposal is the first step to us to get funding from anywhere. You need to write a proposal to your advisor, your university, your company, your city, state, governmental agencies (USGS, USDA, NSF, NASA, NOAA, DoEd, DOE, ....) for any funding opportunity. A proposal for a small amount of fund is usually 5 pages; for a standard and multi-year proposal to governmental agencies, it is us...
متن کاملچشمه نور ایران، اولین آزمایشگاه ملی برای تحقیقات بین رشتهای
The Iranian Light Source Facility (ILSF) project is the first large scale accelerator facility which is currently under planning in Iran. On the basis of the present design, circumference of the 3 GeV storage ring is 528 m. Beam current and natural beam emittance are 400 mA and 0.477 nm.rad, respectively. Some prototype accelerator components such as high power solid state radio frequency ampli...
متن کاملContemporary methods for evaluating complex project proposals
The ability to evaluate project proposals, assessing future success, and organizational value is critical to overall business performance for most enterprises. Yet, predicting project success is difficult and often unreliable. A four-year field study shows that the effectiveness of available methods for evaluating and selecting large, complex project depends on the specific project type, org...
متن کامل15-712 Systems Final Report Methods for Recognizing Service Quiescence
Our motivation for this project is evaluating our hypothesis that if we had a variety of statistics concerning process resource consumption, we would be able to determine whether and when a process is quiescent.1 We instrument the Linux operating system to allow us to gather relevant statistics, and write a second program, which we call QAnalyzer, that allows us to analyze these statistics, loo...
متن کامل15-740 Project Milestone Report
The ideal choices for the tasks presented in the project proposal would be the NVIDIA CUDA toolkit (http://developer.nvidia.com/object/cuda. html), which exposes more underlying architecture to programmers. However, the package requires a capable NVIDIA video card, and we could not get for this project. ATI also designed a similar platform “Close-to-Metal (CTM) Device” (http://ati.de/companyinf...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007